AITopics | robust stochastic operator

A Family of Robust Stochastic Operators for Reinforcement Learning

Neural Information Processing SystemsDec-25-2025, 20:01:50 GMT

We consider a new family of stochastic operators for reinforcement learning with the goal of alleviating negative effects and becoming more robust to approximation or estimation errors. Various theoretical results are established, which include showing that our family of operators preserve optimality and increase the action gap in a stochastic sense. Our empirical results illustrate the strong benefits of our robust stochastic operators, significantly outperforming the classical Bellman operator and recently proposed operators.

name change, reinforcement learning, robust stochastic operator, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Add feedback

Bellman operator convergence enhancements in reinforcement learning algorithms

Kadurha, David Krame, Moutouo, Domini Jocema Leko, Gaba, Yae Ulrich

arXiv.org Artificial IntelligenceMay-21-2025

This paper reviews the topological groundwork for the study of reinforcement learning (RL) by focusing on the structure of state, action, and policy spaces. We begin by recalling key mathematical concepts such as complete metric spaces, which form the foundation for expressing RL problems. By leveraging the Banach contraction principle, we illustrate how the Banach fixed-point theorem explains the convergence of RL algorithms and how Bellman operators, expressed as operators on Banach spaces, ensure this convergence. The work serves as a bridge between theoretical mathematics and practical algorithm design, offering new approaches to enhance the efficiency of RL. In particular, we investigate alternative formulations of Bellman operators and demonstrate their impact on improving convergence rates and performance in standard RL environments such as MountainCar, CartPole, and Acrobot. Our findings highlight how a deeper mathematical understanding of RL can lead to more effective algorithms for decision-making problems.

artificial intelligence, machine learning, reinforcement learning, (11 more...)

arXiv.org Artificial Intelligence

2505.14564

Country:

Africa > Cameroon (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Africa > South Africa > Gauteng > Pretoria (0.04)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reviews: A Family of Robust Stochastic Operators for Reinforcement Learning

Neural Information Processing SystemsJan-26-2025, 05:22:13 GMT

SUMMARY: The paper considers the problem of designing a Bellman-like operator with certain properties: 1) Optimality preserving property: The greedy policy of the converged action-value function be the optimal policy. The motivation for the action-gap increasing property comes from the result of Farahmand [12] that shows that the distribution of the action-gap is a factor in the convergence to the optimal policy. Roughly speaking, when the action-gap is large, errors in estimating the action-value function Q becomes less important. The result is that we might converge to the optimal policy even though the estimated action-value function is far from the optimal one. Bellemare et al. [5] propose some operators that have these properties.

action-value function, bellman operator, operator, (11 more...)

Neural Information Processing Systems

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)

Add feedback

Reviews: A Family of Robust Stochastic Operators for Reinforcement Learning

Neural Information Processing SystemsJan-26-2025, 05:22:03 GMT

The paper proposes a family of robust stochastic operators for RL. This is quite original and potentially impactful. The reviewers raised important questions regarding the clarity of the proofs that was generally answered in the rebuttal. I also read the paper. It makes an important and original contribution.

reinforcement learning, reviewer, robust stochastic operator, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

A Family of Robust Stochastic Operators for Reinforcement Learning

Neural Information Processing SystemsOct-10-2024, 16:13:02 GMT

We consider a new family of stochastic operators for reinforcement learning with the goal of alleviating negative effects and becoming more robust to approximation or estimation errors. Various theoretical results are established, which include showing that our family of operators preserve optimality and increase the action gap in a stochastic sense. Our empirical results illustrate the strong benefits of our robust stochastic operators, significantly outperforming the classical Bellman operator and recently proposed operators.

reinforcement learning, robust stochastic operator

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Add feedback

A Family of Robust Stochastic Operators for Reinforcement Learning

Lu, Yingdong, Squillante, Mark, Wu, Chai Wah

Neural Information Processing SystemsMar-19-2020, 03:04:07 GMT

We consider a new family of stochastic operators for reinforcement learning with the goal of alleviating negative effects and becoming more robust to approximation or estimation errors. Various theoretical results are established, which include showing that our family of operators preserve optimality and increase the action gap in a stochastic sense. Our empirical results illustrate the strong benefits of our robust stochastic operators, significantly outperforming the classical Bellman operator and recently proposed operators. Papers published at the Neural Information Processing Systems Conference.

reinforcement learning, robust stochastic operator

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Add feedback

A General Family of Robust Stochastic Operators for Reinforcement Learning

Lu, Yingdong, Squillante, Mark S., Wu, Chai Wah

arXiv.org Machine LearningMay-28-2019

We consider a new family of operators for reinforcement learning with the goal of alleviating the negative effects and becoming more robust to approximation or estimation errors. Various theoretical results are established, which include showing on a sample path basis that our family of operators preserve optimality and increase the action gap. Our empirical results illustrate the strong benefits of our family of operators, significantly outperforming the classical Bellman operator and recently proposed operators.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1805.08122

Country: